Web crawler

Results: 342



#Item
71Cloud infrastructure / Parallel computing / Apache Hadoop / Hadoop / MapReduce / Nutch / Web crawler / HBase / Amazon Elastic Compute Cloud / Computing / Cloud computing / Concurrent computing

PDF Document

Add to Reading List

Source URL: media.blackhat.com

Language: English - Date: 2012-04-07 14:20:15
72Framework Programmes for Research and Technological Development / Cyberwarfare / World Wide Web / Web crawler / Computer security / Vulnerability

Inclusive Growth Research Infrastructure Diffusion CALL FOR EXPERT WORKSHOP ON THE USE OF WEB CRAWLING DATA IN IDENTIFYING NEW JOBS AND NEW SKILLS This workshop takes place on 20 October 2014 at CEPS, Brussels. Motivated

Add to Reading List

Source URL: inclusivegrowth.be

Language: English - Date: 2014-08-08 08:18:40
73Spamdexing / Spam / Email spam / URL redirection / Web crawler / Search engine optimization / PageRank / Google Search / Spam in blogs / Internet / Spamming / Computing

Spam, Damn Spam, and Statistics Using statistical analysis to locate spam web pages Dennis Fetterly Mark Manasse

Add to Reading List

Source URL: research.microsoft.com

Language: English - Date: 2004-05-24 20:16:38
74Web crawlers / World Wide Web / Distributed web crawling / Searching / Search engine indexing / Hypertext Transfer Protocol / Scheduling / Information science / Information retrieval / Computing

Storm Crawler A real-time distributed web crawling and monitoring framework Jake Dodd, co-founder http://ontopic.io

Add to Reading List

Source URL: events.linuxfoundation.org

Language: English - Date: 2015-04-16 10:43:32
75Information science / Web crawler / Sitemaps / Archive / Website / Web content / Link rot / Heritrix / World Wide Web / Web archiving / Computing

The UK Government Web Archive Guidance for digital and records management teams © Crown copyright 2015 You may re-use this information (excluding logos) free of charge in any format or medium, under

Add to Reading List

Source URL: nationalarchives.gov.uk

Language: English - Date: 2015-01-29 07:28:53
76Crowdsourcing / PageRank / Reputation management / Searching / Information retrieval / Information science / Web crawler / Webgraph / Link analysis / Markov models / Search engine optimization

Do Your Worst to Make the Best: Paradoxical Effects in PageRank Incremental Computations∗ (Extended Abstract) Paolo Boldi† Massimo Santini‡

Add to Reading List

Source URL: vigna.di.unimi.it

Language: English - Date: 2004-07-07 11:13:31
77Human–computer interaction / Web design / Search engine optimization / Cache / Web crawler / Web search engine / Web cache / Proxy server / Web archiving / Computing / Internet / World Wide Web

Lazy Preservation: Reconstructing Websites by Crawling the Crawlers Frank McCown, Joan A. Smith, and Michael L. Nelson Old Dominion University Computer Science Department

Add to Reading List

Source URL: www.cs.odu.edu

Language: English - Date: 2006-10-19 18:07:18
78Web design / World Wide Web / Search engine optimization / Web crawlers / Information retrieval / Sitemaps / Invisible Web / Web search engine / Web archiving / Computing / Information science / Internet

Evaluation of Crawling Policies for a Web-Repository Crawler Frank McCown Michael L. Nelson

Add to Reading List

Source URL: www.cs.odu.edu

Language: English - Date: 2006-06-09 14:30:27
79Web crawler / Model-based testing / Shortest path problem / Rich Internet application / Graph / Graph theory / Mathematics / Theoretical computer science

Building Rich Internet Applications Models: Example of a Better Strategy Suryakant Choudhary1 , Mustafa Emre Dincturk1 , Seyed M. Mirtaheri1 , Guy-Vincent Jourdan1,2 , Gregor v. Bochmann1,2 , and Iosif Viorel Onut3,4 1

Add to Reading List

Source URL: ssrg.site.uottawa.ca

Language: English - Date: 2013-04-30 15:56:24
80Information retrieval / Searching / Web crawlers / World Wide Web / Consistent hashing / Robots exclusion standard / Hash function / Hash table / Cryptographic hash function / Hashing / Information science / Search algorithms

UbiCrawler: A Scalable Fully Distributed Web Crawler Paolo Boldi∗ Bruno Codenotti† Massimo Santini‡

Add to Reading List

Source URL: vigna.di.unimi.it

Language: English - Date: 2003-10-19 11:16:51
UPDATE